A Fast and Provable Method for Estimating Clique Counts Using Turán's Theorem

نویسندگان

  • Shweta Jain
  • Seshadhri Comandur
چکیده

Clique counts reveal important properties about the structure of massive graphs, especially social networks. The simple setting of just 3-cliques (triangles) has received much attention from the research community. For larger cliques (even, say 6-cliques) the problem quickly becomes intractable because of combinatorial explosion. Most methods used for triangle counting do not scale for large cliques, and existing algorithms require massive parallelism to be feasible. We present a new randomized algorithm that provably approximates the number of k-cliques, for any constant k. The key insight is the use of (strengthenings of) the classic Turán’s theorem: this claims that if the edge density of a graph is sufficiently high, the k-clique density must be non-trivial. We define a combinatorial structure called a Turán shadow, the construction of which leads to fast algorithms for clique counting. We design a practical heuristic, called Turán-shadow, based on this theoretical algorithm, and test it on a large class of test graphs. In all cases, Turán-shadow has less than 2% error, in a fraction of the time used by well-tuned exact algorithms. We do detailed comparisons with a range of other sampling algorithms, and find that Turán-shadow is generally much faster and more accurate. For example, Turán-shadow estimates all cliques numbers up to size 10 in social network with over a hundred million edges. This is done in less than three hours on a single commodity machine. CCS Concepts •Theory of computation → Social networks; •Mathematics of computing → Extremal graph theory;

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Turán's theorem in sparse random graphs

We prove the analogue of Turán’s Theorem in random graphs with edge probability p(n) À n−1/(k−1.5). With probability 1 − o(1), one needs to delete approximately 1 k−1–fraction of the edges in a random graph in order to destroy all cliques of size k.

متن کامل

On Turáns theorem for sparse graphs

a?nl(t+1) where a denotes the maximum size of an independent set in G. We improve this bound for graphs containing no large cliques. 0. Notation n=n(G)=number of vertices of the graph G e=e(G)=number of edges of G h=h(G)=number of triangles in G deg (P)=valency (degree) of the vertex P deg, (P) = triangle-valency of P=number of triangles in G adjacent to P t=t(G)=n f deg (P) = 2eln = average va...

متن کامل

Edit Distance and its Computation

In this paper, we provide a method for determining the asymptotic value of the maximum edit distance from a given hereditary property. This method permits the edit distance to be computed without using Szemerédi’s Regularity Lemma directly. Using this new method, we are able to compute the edit distance from hereditary properties for which it was previously unknown. For some graphs H, the edit ...

متن کامل

Turán’s theorem with colors

We consider a generalization of Turán’s theorem for edge-colored graphs. Suppose that R (red) and B (blue) are graphs on the same vertex set of size n. We conjecture that if R and B each have more than (1− 1/k)n/2 edges, and K is a (k+ 1)-clique whose edges are arbitrarily colored with red and blue, then R ∪B contains a colored copy of K, for all k + 1 6∈ {4, 6, 8}. If k + 1 ∈ {4, 6, 8}, then t...

متن کامل

New Results on k-Independence of Graphs

Let G = (V,E) be a graph and k > 0 an integer. A k-independent set S ⊆ G is a set of vertices such that the maximum degree in the graph induced by S is at most k. Denote by αk(G) the maximum cardinality of a k-independent set of G. For a graph G on n vertices and average degree d, Turán’s theorem asserts that α0(G) > n d+1 , where the equality holds if and only if G is a union of cliques of equ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017